COST FUNCTIONS FOR SUPERVISED LEARNING BASED ON A ROBUST SIMILARITY METRIC By ABHISHEK SINGH A THESIS PRESENTED TO THE GRADUATE SCHOOL OF THE UNIVERSITY OF FLORIDA IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF SCIENCE

نویسندگان

  • Abhishek Singh
  • John G. Harris
  • John M. Shea
چکیده

of Thesis Presented to the Graduate School of the University of Florida in Partial Fulfillment of the Requirements for the Degree of Master of Science COST FUNCTIONS FOR SUPERVISED LEARNING BASED ON A ROBUST SIMILARITY METRIC By Abhishek Singh May 2010 Chair: José C. Prı́ncipe Major: Electrical and Computer Engineering This thesis proposes cost functions for supervised learning algorithms, based on a robust measure of similarity between random variables, called correntropy. A supervised learning system typically consists of a functional mapper, whose parameters are trained in a way such that its outputs are as close as possible to a set of desired or target values, for a given input dataset. Traditionally, the mean squared value of the error between the output and desired is used as a cost function in several supervised learning applications. We propose to train the function parameters by maximizing the similarity between the output and the desired values in the correntropy sense. We show that correntropy as a cost function is as simple to compute as the mean squared value, and also has the robustness properties of more elaborate cost functions like the entropy of the error. It is therefore a practical way of getting robust results in online applications like adaptive filtering, as corroborated by our results. Further, we propose a way of enhancing the training performance by adapting from data the kernel width parameter that is used in the estimator for correntropy (and entropy). This thesis also proposes a correntropy induced loss function for classification, called the C-Loss function. It can be shown that correntropy is a measure of probability of zero error between the true label and the predicted label. It is therefore quite a natural quantity to optimize in order to train a classifier. We show that the discriminant function obtained by optimizing the C-Loss function using a neural network is robust to overfitting

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Composite Kernel Optimization in Semi-Supervised Metric

Machine-learning solutions to classification, clustering and matching problems critically depend on the adopted metric, which in the past was selected heuristically. In the last decade, it has been demonstrated that an appropriate metric can be learnt from data, resulting in superior performance as compared with traditional metrics. This has recently stimulated a considerable interest in the to...

متن کامل

REAL-TIME TRANSACTION SCHEDULING: A COST-CONSCIOUS APPROACH By DONG-KWEON HONG A THESIS PRESENTED TO THE GRADUATE SCHOOL OF THE UNIVERSITY OF FLORIDA IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF SCIENCE UNIVERSITY OF FLORIDA

of Thesis Presented to the Graduate School of the University of Florida in Partial Ful llment of the Requirements for the Degree of Master of Science REAL-TIME TRANSACTION SCHEDULING: A COST-CONSCIOUS APPROACH By DONG-KWEON HONG DECEMBER, 1992 Chairman: Dr. Sharma Chakravarthy Cochairman: Dr. Theodore Johnson Major Department: Computer and Information Sciences Real-time databases are an increas...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010